Group membership prediction when known groups consist of unknown subgroups: a Monte Carlo comparison of methods
نویسندگان
چکیده
Classification using standard statistical methods such as linear discriminant analysis (LDA) or logistic regression (LR) presume knowledge of group membership prior to the development of an algorithm for prediction. However, in many real world applications members of the same nominal group, might in fact come from different subpopulations on the underlying construct. For example, individuals diagnosed with depression will not all have the same levels of this disorder, though for the purposes of LDA or LR they will be treated in the same manner. The goal of this simulation study was to examine the performance of several methods for group classification in the case where within group membership was not homogeneous. For example, suppose there are 3 known groups but within each group two unknown classes. Several approaches were compared, including LDA, LR, classification and regression trees (CART), generalized additive models (GAM), and mixture discriminant analysis (MIXDA). Results of the study indicated that CART and mixture discriminant analysis were the most effective tools for situations in which known groups were not homogeneous, whereas LDA, LR, and GAM had the highest rates of misclassification. Implications of these results for theory and practice are discussed.
منابع مشابه
Model Selection for Mixture Models Using Perfect Sample
We have considered a perfect sample method for model selection of finite mixture models with either known (fixed) or unknown number of components which can be applied in the most general setting with assumptions on the relation between the rival models and the true distribution. It is, both, one or neither to be well-specified or mis-specified, they may be nested or non-nested. We consider mixt...
متن کاملInference on Pr(X > Y ) Based on Record Values From the Power Hazard Rate Distribution
In this article, we consider the problem of estimating the stress-strength reliability $Pr (X > Y)$ based on upper record values when $X$ and $Y$ are two independent but not identically distributed random variables from the power hazard rate distribution with common scale parameter $k$. When the parameter $k$ is known, the maximum likelihood estimator (MLE), the approximate Bayes estimator and ...
متن کاملComparison of MCNP4C, 4B and 4A Monte Carlo codes when calculating electron therapy depth doses
ABSTRACT Background: accurate methods of radiation therapy dose calculation. There are different Monte Carlo codesfor simulation of photons, electrons and the coupled transport of electrons and photons. MCNPis a general purpose Monte Carlo code that can be used for electron, photon and coupledphoton-electron transport.Monte Carlo simulation of radiation transport is considered to be one of the ...
متن کاملPrediction Based on Type-II Censored Coherent System Lifetime Data under a Proportional Reversed Hazard Rate Model
In this paper, we discuss the prediction problem based on censored coherent system lifetime data when the system structure is known and the component lifetime follows the proportional reversed hazard model. Different point and interval predictors based on classical and Bayesian approaches are derived. A numerical example is presented to illustrate the prediction methods used in this paper. Mont...
متن کاملSimulation-Based Radar Detection Methods
In this paper, radar detection based on Monte Carlo sampling is studied. Two detectors based on Importance Sampling are presented. In these detectors, called Particle Detector, the approximated likelihood ratio is calculated by Monte Carlo sampling. In the first detector, the unknown parameters are first estimated and are substituted in the likelihood ratio (like the GLRT method). In the sec...
متن کامل